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Which geometries on a smooth manifold (apart from Lorentzian metrics) can serve 
as a spacctimc structure? This question is comprclicnsivcly addressed from first prin- 
ciples in eight lectures, exploring the kinematics and gravitational dynamics of all 
tensorial geometries on a smooth manifold that can carry predictive matter equa- 
tions, are time-orientable, and allow to distinguish positive from negative particle 
energies. 
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INTRODUCTION 

The recent announcement of superluminal neutrino propagation by the OPERA collab- 
oration is a reminder that the spacetime geometry might well not be given by a Lorentzian 
manifold. But by what else? Fortunately, this question has a rather comprehensive answer. 
For we will show in these lectures that the spectrum of tensor fields that can serve as a space- 
time geometry — in the sense that matter field dynamics is predictive and observers agree on 
the sign of particle energies — is severely restricted. Only geometries on which the dispersion 
relation of matter fields is encoded in a totally symmetric contravariant even-rank tensor field 
P satisfying three simple algebraic conditions - it must be hyperbolic, time-orientable and 
energy-distinguishing - can present candidates for a spacetime geometry. These physically 
inevitable properties single-handedly ensure that the entire kinematical apparatus familiar 
from physics on a Lorentzian manifold is defined precisely the same way for any spacetime; 
causality, in particular, is perfectly compatible with superluminal propagation in spacetimes, 
but one only learns this from a subtle interplay of convex analysis, real algebraic geometry 
and the modern theory of partial differential equations. The relevance of this class of ge- 
ometries roots in that fact that they already exhaust the possible spacetime structures on a 
smooth manifold. Thus this class, and only this class of geometries, merits our attention. 

Lorentzian manifolds simply present the simplest example of a tensorial spacetime geom- 
etry. But considering any one of the more refined geometries of course eradicates the very 
foundation of Einstein's field equations as we know them. It does not eradicate, though, 
the deeper principle behind them that was revealed by the Wheeler school a long time ago: 
in their geometrodynamical view, gravitational dynamics is all about evolving the spatial 
geometry from one suitable initial data surface to an infinitesimally neighbouring one, such 
that ultimately all spatial geometries recombine to an admissible spacetime geometry. The 
geomctrodynamic principle stands independent of the particular spacetime geometry it is 
applied to. Thus the second key insight arrived at in these lectures is that finding the 
gravitational dynamics of refined spacetime geometries does not require inspired physical 
guesswork, but is reduced to solving a clear-cut mathematical problem. It remains of course 
for experiment to decide which member of the only countable class of classical spacetime 
theories is realized in Nature. 
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LECTURE I: MANIFOLDS 

Every study needs to start from foundations that are not further questioned. For the 
present lectures, this is the assumption that spacetime is certainly a smooth d-dimensional 
manifold M. For the benefit of the non-specialist, this first lecture recalls the relevant 
definitions from topology and differentiable manifold theory. 

Definition. A set M is made into a topological space (M, O) by choosing a collection O of 
(then-to-be-called open) subsets of M, provided the choice has been made such that 

1. the trivial subsets are open: 0, M e O , 

2. finite intersections of open sets are open: U,V e O implies U DV e O , 

3. arbitrary unions of open sets are open: Ua & O for all a e A implies UaeA Ua & O . 

Renicirks. 

1. The choice of topology for any given set is far from unique. For sets of cardinality from 

one to seven, there are 1, 4, 29, 355, 6942, 209527 and 9535241 topologies, respectively, 
and this number increases rapidly with growing cardinality. 

2. The coarsest topology any set M can be equipped with is O = {0, M} (there is 
obviously no topology with fewer open sets), while the finest topology is O = V{M) = 
{U \U C M} (there is obviously no topology with more open sets). Useful choices for 
topologies usually lie between these extremes. 

3. Only for finite sets M may one provide a topology O by writing down the complete 
list of sets one chooses to be open. For inifinite sets one needs to resort to an indirect 
definition of the open sets and then prove that their collection O indeed makes M into 
a topological space. This is the case for the important example that follows. 

Definition. A subset U of M'^ is called open in the standard topology Og on M"^ if for every 
X = {x^, . . . , x'^) e U there exists a positive real e such that the ball 

d 

B,{x) = {{y\ ...,/) e I Y.{x' - f f < e} 

1=1 
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lies entirely within U . 

Reiiicirk. We will tacitly assume in the following that R*^ is equipped with the standard 
topology, unless explicitly stated otherwise. 

Exercise. Prove that (R'', Og) is a topological space. 

Definition. A map (f) : M ^ N between two topological spaces (M, 0m) and {N,0n) is 
called continuous if preim^(y) = {x E M \ (f){x) E V} E 0m for every V e 0n- A bijection 
(f) is called a homeomorphism if both (p and 4>~^ are continuous. Two topological spaces 
between which there exists a homeomorphism are called homeomorphic. 

Remarks. 

1. It is easy to see that any map cf) : M ^ N is continuous if N is equipped with the 
chaotic topology or if M is equipped with the discrete topology. This is an example 
of the earlier claim that useful topologies lie somewhere between these extremes. 

2. The definition of continuity of a function / : R — > R reduces to the elementary e-S- 
criterion from undergraduate analysis. 

Definition. A topological space (M, 0) is called a d-dimensional topological manifold if for 
every p e M there exists an open set U 3 p and a homeomorphism :[/—>■ 0(C/) C R*^. 
The component functions (f?- , . . . , (f)'^ : U — > R are then called the coordinate functions on U, 
and for every point qEU, the d-tuple {(p^iq), ■ ■ ■ , (t>'^{q)) is called the coordinates of q, with 
respect to the chart (C/, 0). 

Remarks. 

1. Covering a manifold by charts is what sailors do. This allows to study objects on the 
manifold in terms of their chart representatives. For instance, a curve 7 : R — >■ M (the 
course of the ship) on the manifold (in the real world) is represented in a chart (C/, 0) 
by the curve : preim^(?7) — )■ 0(?7) with 7,/, = o 7 (the course drawn on the paper 
chart). 

2. In general a single chart does not suffice to cover the entire manifold, as is well-known 
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for maps of the Earth. In general one needs several overlapping charts, and one needs 
to know how to make transitions between these charts. 



Definition. Two charts {U, 0) and {V, ip) of a topological manifold are called C'^-compatible 
if either (a) t/ fl = or (b) n V 7^ and the chart transition map 

which by construction is map between open subsets on R'^, is invertible, and both the 
transition map and its inverse are A;-times continuously differentiable. A C^-atlas is a family 
of mutually C'^-compatible charts {Ua,(f)a)aeA with M = [J^^^Ua- A C'^-atlas is called 
maximal if any chart (A^, z/) that is C'^-compatible with all charts in the atlas is already 
contained in the atlas. A topological manifold (M, 0) equipped with a maximal C'^-atlas is 
called a C*^- (differentiable) manifold. 

Remcirks. 

1. Starting from a maximal C°-atlas, charts need to be removed in order to obtain a 
maximal C^-atlas, and even more charts need to be removed to obtain a C^-atlas, 
and so forth. In practice, however, one simply specifies a maximal C'^-atlas A by 
specifying some (rather minimal than maximal) C'^-atlas and then declaring any 
further chart that is C*^-compatible with every chart in ^0 an element of A. 

2. On a C^-manifold {M,0,A) one may now define a curve 7 : M — )■ M to be 
m-times continuously differentiable, or C*^, at parameter value to G M, if for some 
chart {U, (p) with 7(^0) G U the chart representative 7^ is m-times continuously differ- 
entiable in as a curve on R*^. It is then precisely the mutual C'^-compatibihty of all 
charts in the atlas A that guarantees that also the chart representative 7^ of the curve 
with respect to any any other chart {V,ip) with 7(^0) £ V is m-times continuously 
differentiable at to, since 

7^ = ^07 = ^0 0^07 = (^o (f)-^) o 7^ 

and the chart transition map ip o cf)"^ is /c-times continuously differentiable. Hence 
the above definition of m-times differentiable curves is independent of the actual chart 
representative, and thus well-defined. 
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3. It is easy to see that any two charts on a topological manifold are automatically C°- 
compatible. Thus every topological manifold is a C'^-manifold, and vice versa. In 
particular, it depends on one's taste of whether one wishes to decide the continuity of 
a curve, say, directly at the level of the topological manifold in terms of its topology 
or, alternatively, at the level of charts in terms of the continuity of some chart rep- 
resentative. In contrast, C'^-differentiability for /c > can no more be decided at the 
topological manifold level, but one must descend to the level of charts. 

Exercise, (a) Provide a definition of an m-times continuously diffcrcntiable function / : 
M — )> M on a C'^-'^-manifold and show that the definition is independent of the choice of 
chart, (b) Show that the set C'^(M) of /c-times continuously differentiable functions is made 
into a real vector space, where addition of two functions and multiplication of a real number 
with a function are defined pointwise. 

Example. We illustrate the abstract theory developed here for the example of the real plane 
M — M.^, which we make into a topological space by equipping it with the standard topology. 
But then it is already a topological manifold of dimension 2 that can be covered by a single 

chart {U, (p) with f/ = and : f/ — )■ defined by (f)^{x, y) = x and (f)'^{x, y) = y, which is 
of course continuous in the standard topology. Thus we have an atlas A consisting of only the 
chart ([/, 0), which is trivially C°°-compatible with itself, so that ^ is a C°°-atlas. Extending 
^ to a maximal atlas by assuming that any other chart (V, ip) that is C°°-compatible to (C/, 0) 
is also contained, we have thus constructed a C°°-manifold (R^, O, ^max)- An interesting 
example of another chart in AmsLx is V = R^\{(s, 0) | s G [0, oo)} and -0 : y ^ R^ defined 
by %l)^{x,y) = ^/ x'^ + y'^ and 'i/j'^{x,y) = aicta.n{y / x) . The chart transition map is then 

00 V^-i : R+ X (0,27r) ^ R2\{(0,s) |s G [0,oo)}, (0 o ^-i)(r, = {r cos ip,r sin ip) , 

which is clearly invertible and C°°. Hence indeed, {V,ip) is contained in the maximal atlas 
we constructed. 
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LECTURE II: TENSORS 



Without additional structure on a smooth manifold, the most general type of data one 
can define on a differentiable manifold are tensor fields, whose definition and properties we 
concisely introduce in this second lecture. Indeed, it will be tensor fields that will serve in the 
next lecture as the mathematical objects encoding prototypical matter and the fundamental 
geometry on a smooth manifold. 

Definition. Let M be a C°°-manifold and 7 a C^-curve on M. Then the tangent vector 7p 
to the curve 7 at the point p — j{tp) is the linear map 



Remarks. 

1. The sum of two tangent vectors at the same point p, defined by its action on an arbi- 
trary -function / as 

{ip®^p)f = ipf + Kf^ 

is again a tangent vector, i.e., cxp = jp(BSp for some curve a : M — )> M through p. Such 
a curve is readily constructed using some chart {U, (p) with p e t/; for 

a{t) := ((0 o 7) W + (0 o 6){t) - {cj> o 7)(g) 

one has <j{tp) = p and for any C^-function / 

crj = daif o r'mp)) [^(tp) + ^(^p)) = V + V 

according to the mult i- dimensional chain rule (and summing over repeated indices) 
for the function f o <p~^ and curves (f) o 5 and o 5 on M*^. Similarly, one defines the 
S'-multiplication of real number with a tangent vector and shows that the result is 
again a tangent vector. 

2. The set TpM of all tangent vectors through a point p, equipped with ® and consti- 
tutes a real vector vector space, the tangent space to M at p. 
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3. A chart {U, 0) induces at each point p e M a particular basis of the tangent vector 
space TpM, for then 

ipf = ytp) = — — {tp)da{f o <t> ){(t>{p)) , 

and defining the chart-dependent derivative operators ■ C^{M) ^ M by their 

action 



on any C^-function / on M, we find that 



7p 



d 



Thus we see that a chart {U, 4>) gives rise to the chart-induced basis 



d 



d 



in each tangent space TpM ii p G U, and the components of the tangent vector with 
respect to this basis arc given by the derivative of the chart-representative 7^ with 
respect to its curve parameter. 

4. Under a change of chart from {U, 0) to (V, ■0), the chart- induced basis vectors change 
by the hnear transformation 



d 



(r^<t>\ \m 
y^'4)\p) a 



d 



90^ 



where (C^|p)™a = <9a(0 ° '0 Accordingly, the components of a tangent vector 

in the new basis are 

-itp), 



dt 



dt 



where it should be noted that it now reads C^\p instead of C^|p, and that these two 
matrices are related by inversion, (C'f |p)™'a(Cf'|p)"„ = 5™. 



Definition. The differential dpf of a Ci-function / on M is the linear map 



dpf:TpM^R, dpf{X)^Xf. 
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Remcirks 

1. Given a chart {U,(f)), the components (j)"- : U — )> M with a = !,...,(/ of the chart 
map are C^-functions, and thus the differentials dp(j)^, . . . , dp(f)'^ are d elements of the 
so-called cotangent space T*M at p, the dual vector space to TpM. 

2. The action of the differentials of the components of the chart map act on the chart- 



induced basis of TpM as 



d 



d_ 



Thus we find that the dp4>^, . . . , dpcf)'^ constitute a basis of T*M, namely the dual basis 
to the chart-induced basis on the tangent space TpM. 

3. Expanding a vector v G TpM and a co- vector cu e T^M in the basis and dual basis 
induced by a chart {U, (f)) containing p, 

. d 



one finds that in terms of components, 



and u! — u'''adp4)"' , 



uj{X) = utdpriv^' 



_d_ 



4. Under change of chart from (C/, 0) to with both containing the point p, the 

differentials transform as 

dpr ^ {ci\p)\dp4>' 



Definition. On a C^-manifold M, a tensor of valence (r, s) at a point p e M is an element 
t of the vector space 



{TpM)i = t;m 



r times 



'T*M®TpM' 



TpM . 



s times 



The components of t with respect to a chart (t/, 0) containing the point p are the real 
numbers 



bi. 



,,^^t{dpr\...,dpr', 



d 



d 



and the tensor can be reconstructed from its components by virtue of 



J. J. ai...ar 

t — t^ bl...6. 



d 



d 



dpcf)''^ (8) • • • (g) dpcf)" 
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Remcirks. 

1. Recall that for finitc-dimcnsional real vector spaces V and W, and v & V and w € W, 
the tensor product (of vectors) v <S)W is the bilinear map 

V <S) w : V* X W* R, {v(g> w){iy, jj) = v{h')w{jji) , 

and the tensor product (space) is the real vector space 

V ®W ^ {a'^Vi (^Wj\i^l,..., dim V; j ^1,..., dim W} 

(where vi, . . . , Vdimv and wi, . . . , Wdimw are bases of V and W, respectively), equipped 
with the addition of bilinear maps and their multiplication with real numbers. 

2. According to the above definition, (TpM)^ equipped with the addition of multi-linear 
maps and their multiplication with real numbers is a vector space of dimension 
(dimM)''+*. Further, a co- vector u e T*M is a tensor of valence (0,1). By virtue 
oi V — (V*)* for any finite-dimensional vector space V, one also sees that a vector 
X e TpM is a tensor of valence (1,0). It is customary to say that a scalar (real 
number) is a tensor of valence (0, 0) at p. 

3. Under a change of chart from (C/, 0) to (V^t/j), the components of a tensor of valence 
(r, s) change as 

■/■ ai...ar (r-<^\ai ( rii>\ar ( r~i4>\ni (r^4>\ns j. mi...mr 

l^tl} bi...bs — K'-^S ) mi ■ ■ ■ K'-^ d, ) rrirK'-^th) bi ■ ■ ■ K'-^-dj) bs^i) m.-.n^ ■ 



Definition. The tensor bundle (TM)^ over a manifold M is the set 

{TM): = {(p, t)\peM and t e (T,M);} 

together with the canonical projection map tt : TM — > M defined by 7r(p, t) — p. A 
tensor field T is a map T : M ^ {TM)l for which (tt o T){p) = p for all points pE M. 

Remcirks. 

1. A chart (t/, 0) induces on its domain the (l,0)-tensor (vector) fields 

^ -.u^iTuy,, p^ (p, ^ 
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and the (0, l)-tensor fields (co- vector) 

2. Expressing a tensor field T is terms of its component functions with respect to the 
above tangent and cotangent basis fields, 

d d 

T is called a C^'-tensor field if the above tensor component functions T"-'^-^''bi...bs '■ 
4>{U) M are m-times continuously differentiable. 



LECTURE III: HYPERBOLIC GEOMETRIES 



In this lecture, we start our investigation of which geometries on a smooth manifold can 
serve as a geometry of spacetime. We will start from the most general assumption of the 
geometry being given by some arbitrary tensor field, and identify crucial properties such a 
tensor field must satisfy in order to present a viable spacetime geometry. This will culminate 
in our final definition of a spacetime in lecture V. 

Definition. A geometry on a smooth manifold M is given by a smooth tensor field G, 
which may be restricted by further conditions. 

Examples. 

1. The traditionally most intensively studied geometry is Riemannian geometry, given 
by a (0, 2)-tensor field g that is restricted to be symmetric {g{X, Y) — g{Y, X) for all 
vectors X, Y at each point) and positive definite (g(X, X) > for every non-vanishing 
vector X at each point). The spacetime geometry in general (and hence special) 
relativity is a Lorentzian geometry, given by a (0, 2)-tensor g that is symmetric and 
of signature (l,dimM — 1). 

2. The geometry of the phase space of classical mechanical systems carries a symplcctic 
geometry, given by a (0,2)-tensor field u that is restricted to be anti-symmetric 
{u!{X,Y) — —uj{Y,X) for all vectors X,Y at each point), non-degenerate {uj{X,Y) 
for all vectors X at a point already imphes that Y — 0.) and closed {^u!bc+ ^i^ca + 
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3. As one example for a spacetime geometry beyond Lorentzian manifolds, we will meet 
area metric geometry, given by a (0, 4) -tensor field G that features the symmetries 

G{X,Y,A,B)^G{A,B,X,Y) and G{A,B,X,Y) ^ -G{B,A,X,Y) 

and is non-degenerate in the sense that G{A, B, X,Y) — for all linearly independent 
pairs of vectors A, B already implies that the pair of vectors X, Y is linearly dependent. 
Employing the techniques we will develop in these lectures, we will see that in order to 
serve as a physically viable spacetime structure, an area metric needs to be restricted 

by further algebraic conditions (which are precisely the same deep conditions that lead 
to the Lorentzian signature condition in the metric case). 



Remcirks. 

1. A geometry is called flat if there exists an atlas such that in each chart the component 
functions of the tensor field G are constant. Unless explicitly stated, however, in these 
lectures we will not make the assumption that the geometry (M, G) is fiat. 

2. In Riemannian geometry (M, g), a necessary and sufficient criterion for fiatness is that 
the Riemann-Christoffel tensor associated with the metric g vanishes. For a symplectic 
geometry {M,uj), in contrast, it is a fundamental result that one can always find 
"Darboux" charts, where the components of the symplectic form uj are constants, so 
that every symplectic manifold is flat. 



Definition. Probing matter on a smooth manifold M equipped with a geometry G is given 
by a tensor field $ that takes its values in some vector subspace F of a {TM)g and satisfies 
a linear partial differential equation 



Mn" (■^) ^ii ■ ■ ■ ^i, 



n=l 



= 0, 



where the coefficients Q are constructed solely from the components of G and its partial 
derivatives such that (a) the entire equation transforms as a tensor and (b) the initial-value 
problem is well-posed. Here the small Latin indices run over 1, . . . , dimM and the capital 
Latin indices run over 1, . . . , dim V. 
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Remcirks. 

1. The terminology "probing matter" refers to the hnearity of the differential equation. 
For we will see that studying the properties of a tensor field governed by a linear 
differential equation, one can learn important lessons about the underlying geometry. 
While at first sight one could learn similar lessons starting from non- linear equations, 
it would often be impossible to disentangle the properties of the matter field from 
properties of the underlying geometry. 

2. A restriction of $ to a proper vector subspace V is effected by linear conditions on 
such as symmetry conditions. For instance, while a generic (0, 2)-tcnsor $ takes its 
values in a (dim M) ^-dimensional tensor space, a symmetric one takes its values in a 
subspace V with dimV^ — dim M (dim M + l)/2. 

3. Under a change of chart, only the highest order coefficient Q^mn" transforms as a tensor 
with respect to all its small and capital Latin indices. The lower order coefficients 
however generically pick up additional terms involving higher order coefficients, and 
thus do not transform as tensors. 

4. The initial-value problem is well-posed if there exist hypersurfaces such that the pre- 
scription of initial data on these surfaces uniquely determines, by virtue of the above 
differential equation, the value of $ at every point of the manifold. This prediction of 
the future, or post-diction of the past, is the essence of classical physics. 

Theorem. If the hnear partial differential equations of the previous definition have a well- 
posed initial value problem, then 

P{x, k) = ±p det [Q'^f {x)h, . . . ki^l 

defines a positive hyperbolic homogeneous polynomial in each cotangent space T*M, where p 
is a scalar density constructed from the geometric tensor G such that P{x, k) is a co-tangent 
bundle function and 

1. homogeneous means that for any A and any co- vector k e T*M one has P{x,Xk) — 
^deg Pp(^x, k), where degP is called the degree of P 
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2. hyperbolic means that there exists at least one co- vector h (called a hyperbohc co- 
vector of P at X e M) with P{x, h) such that for every co-vector q e T*M the 
polynomial equation 

P{x,q + \h) = 

in the real number A has deg P many (and thus only) real solutions. 

3. positive means that the sign ± is chosen such that P{h) > 0. 




FIG. 1. A CO- vector is hyperbolic if any line in its direction intersects the vanishing set of the 
homogeneous polynomial P exactly degP many times; shown examples are for degree two and 
four, respectively. 

Remarks. 

1. The scahng of the co-tangent bundle function introduced by some choice of scalar den- 
sity p does not affect the hypcrbolicity condition, or in fact any of the other conditions 
we will reveal in the course of these lectures to be imposed on a geometry to define a 
bona fide spacetime structure. 

2. Only a hypersurface E whose co-normals at each point y E T, (i.e., co-vectors n G 
T*M such that for every smooth curve 7 on E through y, n{jy) — 0) are hyperbolic 
co-vectors can serve as initial data surfaces for the underlying field equations. The 
condition that the polynomial be hyperbolic thus amounts to the condition that there 
be any viable initial data surfaces. 
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3. The requirement that P be hyperbohc clearly imposes a restriction on the underlying 
geometry G. One main topic of these lectures is to study the full extent of this 
restriction, and to employ these insights in order to identify all geometries that can 
serve as a classical spacetime structure. 

Example. For the Klein-Gordon equation on a Lorentzian manifold {M,g), 

g''\x)dad,^x) - ^g'^{x)g^'ix)idagsb{x) + d^gM - d,gab{x))d^^{x) + m''^{x) = , 

one finds P{x, k) — g^''{x)kakb, which is hyperbolic if and only if g is of Lorentzian signature 
(1, d — 1) or (d — 1, 1). The positivity requirement then narrows this down to the signature 
(1, 0? — 1). This means that the Klein-Gordon equation cannot have a well-posed initial value 
problem unless formulated on a Lorentzian geometry. The hypersurfaces with hyperbolic 
co-normal are then precisely the so-called spacclikc hypersurfaces, and indeed it is a well- 
known fact that every initial data surface for the Klein-Gordon equation is spacelike (but the 
converse does not hold). The previous theorem generalizes this to any tensorial geometry. 




FIG. 2. Hyperbolicity cones C(P, h) for the previously shown examples of degrees two and four, 
respectively. 

Definition. If /i is a hyperbolic co-vector of P, then there is an entire connected set of 
hyperbohc co- vectors, the so-called hyperbohcity cone C(P, h) around h. The hyperbohcity 
cone is explicitly obtained by the following construction. Prom the coefficients ho, ... , /ideg p 
of the expansion 



P{x, q + Xh)^ ho{x, q, h) A'^^s-P + h^i^^^ f,) ^degP-i ^ . . . ^ h^^^pix, q, h) 
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one constructs the matrices Hi, H2, . . . , H^egp as 



H,{q,h) 



h 


h?, 


^5 • • 


• ^2i-l 


ho 


h2 




• ^2i-2 





hi 


hs .. 


■ ^2i-3 





ho 


h2 .. 


■ ^2i-4 








.. 


• hi 



where hj is set to for j > i , 



Then the hyperbohcity cone around h is the set 

C(P, h) = {h' e T*M I det ^) > for alH = 1, . . . , deg P} . 



Remctrks. 

1. Prom its definition, it is clear that every h' e C{P, h) represents the same hypcrbolicity 
cone as h does, C{P,h') = C{P,h), and that any hyperbohcity cone is open in the 
standard topology on cotangent space. Carding further showed that C{P, h) is indeed 
a convex cone since for every positive real 

\h' e C(P, h) and h' + h" e C(P, h) , 

positive real A and any h' , h" e C(P, h). 

2. While P is, by definition, positive on the entire cone C(P, h), it can be shown to vanish 
on its boundary dC. 

3. If P factorizes into lower degree polynomials 

P(x, k) = Pi(x, ky^ . . . Pf(x, kyf , 

then P is hyperbolic with respect to h if and only if every Pi, . . . ,Pf is hyperbolic 
with respect to h. For technical reasons, we will assume in the following that unless 
ai — • • • — af — 1, we will replace P at each point x & M by the reduced polynomial 

P{x,k) ^Pi{x,k)...Pf{x,k). 

In any case, for the hyperbohcity cones we have 

C(P, h) = c(Pi, /i) n • • • n c{Pf, h) . 
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Example. For a hyperbolic polynomial P{k) — g~^{k,k) defined in terms of a Lorentzian 
metric of signature (H — •••—), one finds that any co- vector h with g''^{h,h) > is a 
hyperbolic co-vector. Picking one such h, the expansion 

Piq + Xh) = X'g-\h, h) + \2g-\q, h) + g-\q, q) 

identifies the quantities 

ho(q,h) ^ g~^(h,h), hi(q,h) ^ 2g-^(q,h), h2(q,h) ^ g''^(q,q) 

and thus 

'2g-\q,h) 



deti/i(g,/i) = det [2^"^(g,/i)] and detH2{q,h) 



g \h,h) g \q,q) 



so that the hyperbolicity cone of P that contains h is the set 

C{P, h) = {h' e T*M I g-\h', h') > and g-^{q, /i) > 0} . 



LECTURE IV: MASSLESS DISPERSION 

In this lecture we reveal that the hyperbolic polynomial provides the dispersion relation 
for the underlying matter field equations in the high-energy (or, equivalcntly. massless) 
limit. We employ this insight to derive the action for a massless point particle in terms of 
an associated dual polynomial, which in turn will play an important role in the definition 
of spacetimes. 

Definition. A family locally wavehke solutions of linear matter field equations is a family 
of solutions of the form 

= ^^(a;)e'^(^^/^ for 0(x) e U , 

where (?7, 0) is some conveniently small chart of the manifold, A is a positive real parameter, 
the phase function 5 is a smooth real function whose differential dS is everywhere non-zero, 
and the real amplitude tensor field components admit Taylor expansions of the form 

oo 

A^ = Y^ a^{x)X' , where ao(x) ^ 0. 

n=0 
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Remcirks. 

1. A hypersurface along which S — const, is called a wave front , and the differential 
dpS/ A the wave co-vector at p E U. In the course of this lecture, we will learn how to 
associate a ray vector with each wave co- vector in suitable geometries. 

2. Insertion of the locally wavelike solution ansatz into the partial differential field equa- 
tions yields 

QMN'{x)dn ■ ■ ■ di,S{x) a^{x) + (. . . )mA + (. . . )mX'' + • • • = , 

which shows that for a locally wavelike solution one needs, to lowest order in A, 

det{Q'--^'{x)d,,Six)...d,^S{x))^0. 

In terms of the cotangent bundle function P defined in the previous lecture, this means 
that to lowest order in A, the wave co- vector k — dS/ A of a locally wavelike solution 
must satisfy the dispersion relation 

P{x,k) = 0. 

3. By taking into account more than just the lowest order, or even all orders in A, the 
above dispersion relation may or may not be modified if one finally takes the high- 
frequency limit A — )■ 0. If such a modification occurs in the high-frequency limit, we 
call the underlying field equations massive , and otherwise massless . 

Example. We illustrate the derivation of the dispersion relation for a given field equation 
for the example of a Klein-Gordon equation in flat Lorentzian spacetime, which in a suitably 
chosen chart takes the simple form 

rfhdadb(^{x) - m^4){x) = , where m > . 

Insertion of the locally wavelike solution ansatz yields 

-r^'^^'daSd^SA^ + [2i'n''''daSd^Ax + i-q'^^'dad^Ax] A + [r]"'' dad^A^ + m^A^] A' = 

so that S{x) = kaX"' solves the equation to lowest order in A for any co-vector k with 
rj^^kakb — 0, and the equation reduces to 

oo 
j=0 
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where we now expanded the functions ^4;^ in terms of the functions aj and defined a_i = 0. 
This is solved for arbitrary A by 



1 / irn^Lx'^ 



for any real constant a and co- vector I such that r]°'Halb — but r]"'Hakb 7^ . In order to 
keep ^4;^ real, one needs to absorb the phase into the phase function S and thus obtains 

But from this we obtain, having taken into account all orders of A, the dispersion relation 

P{x,dS{x)) = m\ 

Thus we identify the Klein-Gordon equation with m = as massless, and the Klein-Gordon 

equation with m > as massive. 

Definition. The set of massless momenta at a point x e M is the cone 

N^^{ke T*M I P(x, k)^0}, 





FIG. 3. Massless momenta for a polynomial of degree two and four, respectively. 
Remarks. 

1. For technical precision, we will occasionally focus on the smooth subcone 

^smooth _ 1^ g I jjp(^^^ k)y^O}, 

where DP denotes the derivative of the reduced P with respect to the cotangent fibre. 
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2. While the massless momentum cone at each point is determined by the polynomial 
P(x, •), the converse question - namely under which conditions the massless momen- 
tum cone Nx at a point x determines the polynomial up to a constant factor, is 

subtle, but of central importance. Indeed, it can be shown that for a cone A^^; of mass- 
less momenta, the polynomial is determined up to scale, since the real NuUstellensatz 



holds for any reduced hyperbohc polynomial P. Here for any subset S C T*M oi 
cotangent space, X{S) denotes the set of all polynomials on T*M that vanish on S. 
This real NuUstellensatz will become important shortly. 

In order to associate velocity vectors with massless particle momenta in physically mean- 
ingful fashion, we employ the dynamics of free massless point particles. 

Theorem. The action of a free and massless point particle is 



where the function A is a Lagrange multiplier. 
Reiiicirks. 

1. It is clear by construction that the above action describes a free and massless point 
particle. Note, in particular, that the geometry enters the action only through the 
dispersion relation enforced in the Lagrange multiplier term. 

2. In the following, we wish to eliminate the momentum q and the Lagrange multiplier 
A to obtain an equivalent action in terms of the particle trajectory x only. Variation 
of the Helmholtz action with respect to A of course enforces the null condition for 
the particle momentum. Now variation with respect to q yields x — \DPx{q) for all 
q e 7V^™°°*^, which implies the weaker equation 



where [X] denotes the projective equivalence class of all vectors coUinear with the 
vector X. It is this latter equation that we will need to to invert in order to eliminate 
the momentum from the above Helmholtz action. 



= {aP, I a G M} 
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Definition. The polynomial Pf : T^M — > R is called dual polynomial to an irreducible 
polynomial P : T* ^ R if 

P#(L>P^(Ar^'"°°*'^)) = 0, 

and is thus determined up to a real scale. For a polynomial that is reducible into 
irreducible factors we define the dual polynomial as the product 

(Pi(a;, k)--- Pf{x, k))*{x, v) = Pf{x, v) ■ ■ ■ Pf{x, v) , 

of the duals of the irreducible Pj, whence P* is uniquely determined up to a real scale 
and also satisfies the previous equation. 

Renicirks. 

1. The dual null cone N^, defined as the image of the massless co- vector cone A^^. under 
the gradient map 

dP 

DP:N,^T^, k^—{x,k), 
is the vanishing set of P*. This is the geometric meaning of the dual polynomial. 

2. The existence of a dual P*, and indeed its algorithmic computability for any reduced 
hyperbolic polynomial P, ultimately hinges on the real NuUstellensatz mentioned be- 
fore. The the degree of P*, however, is generically different from the degree of P. 

3. In principle, the construction of the dual polynomial to a reduced hyperbolic polyno- 
mial may always be performed using Buchberger's algorithm. The bad news is that, in 
practice, such a direct calculation of dual polynomials of higher degree and in several 
variables (that is precisely the cases we arc interest in) using elimination theory ex- 
hausts the capability of current computer algebra systems. The goods news, however, 
is that in some cases of physical interest one is nevertheless able to guess the dual 
polynomial by physical reasoning and then to readily verify it mathematically. In any 
case, since a dual polynomial always exists for the reduced hyperbolic polynomial we 
are considering here, we will simply assume in the following that a dual P* has been 
found by some method. 



Definition and Theorem. The Gauss map 

[DP] : [TV—] ^ [N*], [q] ^ [DP,{q)] 
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and the dual Gauss map 

[DP*\ : [N*''^°°^^\ [N^l [X\ ^ [DP*{X)] 
are partial inverses of each other, in the sense that for null co- vectors k e N^°°^^ 

[DP*]{[DP,m)) = [k] if dei{DDP,){k) + , 
and similarly with P and exchanged. 

Proof. Writing the defining equation for the dual polynomial in the form 

P*{x, DP{x, k)) = Q{k)P{k) for all co-vectors k , 

(since this form does not require an explicit restriction to null co- vectors), differentiation 
with respect to k yields, by application of the chain rule and then of Euler's theorem 

DP{k)k = {deg P)P{k) 

for the homogeneous function P on the right hand side, for any null co- vector k satisfying 
the non-degeneracy condition det{DDPx){k) ^ that 

DP*{x,DP{x,k)) = ^^^k, 

which in projective language is the statement of the theorem. 

Remcirks. 

1. We may thus solve, up to a real scale, the momentum-velocity relation [£)P-r(?)] = 
[i/A] for the massless particle for the momentum, 

[g] = [DP#]([x/A]), 

and obviously the homogeneity of DPf in conjunction with the projection brackets 
allows to disregard the function A altogether. Translating this result back to non- 
projective language, another undetermined function appears, 

q^p.DP*{x). 
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2. This reveals the physical meaning of the Gauss map [DP^] and its inverse [DP*]: up 
to some irrelevant conformal factor, they associate null particle momenta in TV^mooth 
with the associated null particle velocities in N* 

3. Replacing the momentum in the action for the massless particle, and using again 
Euler's theorem, but now applied to the homogeneous polynomial P*, one obtains 
the massless point particle action 



The automatic appearance of a final Lagrange multiplier also hardly comes as a 
surprise, since it is needed to enforce the null constraint Pf{x) = 0. This reveals the 
direct physical relevance of the dual tangent bundle function P* as the tangent space 
geometry seen by massless particles. 



Finally we are in the position to write down a definite definition of what constitutes a 
spacetime geometry. The requirements of that definition are, on the one hand, necessary 
in order to ensure that the equations for the probing matter are predictive and that (as we 
will show) all observers agree on the distinction of particles and anti-particles; on the other 
hand, these requirements will be sufficient to develop the full kinematical apparatus known 
from general relativity, but for any spacetime geometry in the general sense discussed here. 

Definition. The polynomial P^ : T^M — > R is called bi-hyperbolic if both P^ and its dual 
polynomial P* are hyperbolic polynomials. 



1. Hyperbolicity of a polynomial on tangent space is defined precisely as hyperbolicity of 
a polynomial on cotangent space, but with all co-vectors replaced by vectors. 

2. Hyperbolicity does not imply bi- hyperbolicity. However, Hyperbolic polynomials 
P{k) — g~^{k,k) of degree two (necessarily defined by an inverse Lorentzian metric 
g~l) present an exception; their duals P#(X) = g{X,X) are hyperbolic if and only if 




LECTURE V: SPACETIMES 



Renicirks. 
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P is hyperbolic. This is simply due to the obvious fact that a metric g has Lorentzian 
signature if and only if its inverse has Lorentzian signature. 

Definition. Let T be a nowhere vanishing smooth vector field on M such that for each 
point X & M the vector T{x) lies within one of the hyperbolicity cones of P*, which cone 
we then denote by C*. The thus defined smooth distribution of cones 

C* = {{x,C*)\xeM} 

is then called a time orientation of the manifold (M, G). 




FIG. 4. Time-orientation C# in tangent space for polynomials of degree two and four, respectively. 



Remcirks. 

1. A time orientation is needed in order to have a meaningful definition of observers. 
More precisely, we require that the worldline x : M — )■ M of an observer have tangent 
vectors Xx{T)h € ^^t) parameters r e K. This amounts to requiring that 
observer travel into future directions defined by C*. 

2. A full definition of observers, which includes not only the above constraint on the 
tangent vectors of their worldlines, but also a definition of purely spatial directions 
seen by them, needs to be postponed until we developed more mathematical technology 
in the next chapter. 

3. Already with the partial definition of observers given above, one may define the energy 
of a CO- vector seen by an observer. More precisely, let q E T*M and X e C* the 
tangent vector to an observer's worldline. Then the (observer-dependent) energy of 
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the CO- vector is q{X). Physically, it will of course only be meaningful to speak of the 
energy of a co-vector that represents a particle momentum, and we will restrict to 
those cases later. 

Definition. The cone of positive energy co-vectors at a; e M is the set 

{C*)+ = {pe t;m I p{X) > for all X e C*} . 

Renicirks. 

1. Thus at any point x e M all observers X e Cf agree on the sign of the energy of 
any given co- vector q e C*)"*" at the same point x e M. This is in fact precisely the 
rationale behind the definition of the cone Cf. 

2. If the reduced hyperbohc polynomial P is reducible into factors 

P{X,k)^Pi{x,k) Pf{x,k), 

then the positive energy cone (C*)+ is the sum of the positive energy cones {Cf)~^, . . . , {C^) 
of the irreducible factors, 

= (cf )+ + • • • + {cfy , 

where the sum of two subsets in tangent space is defined as the set of the sum of any 
two elements of the two sets. 

Definition. A bi-hyperbolic polynomial Px is a called energy-distinguishing if the set A^^ = 

{k e T*M I P{k) = 0} of massless momenta is the disjoint union 

Nx = n: u at- , 

of the set A^ = A" n (C*)"*" of positive energy massless momenta and the set A^ — N f] 
— (C*)+ of negative energy massless momenta. 
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Remcirks. 



1. For matter on geometry {M,G) whose principal polynomial Px at each point x E M 
is an energy-distinguishing and bi-hyperbolic polynomial, all observers agree on the 
sign of the energy of any massless particle momentum. This allows, for instance, for a 
unique positive and negative energy split of fundamental solutions if the geometry G 
is flat. 

2. Energy-distinguishing hyperbolic polynomials are of even degree. This is seen as fol- 
lows. First, one proves that bi-hyperbolicity of implies that 



Let ko be such that ko e closure ((C*)"*") and ko e — closure ((C*)"'" ). It follows from 
the definition of the dual cone that the following inequalities are true for all x e 
C* : x.ko > and x.ko < 0. If this were true then the hyperbohcity cone C* 
would have to be a plane or a subset of a plane. That would contradict the property 
of Cf to be open. Second, suppose that the zero set A^^^ contains a plane. From 
closure((C#)+) n -closure((C#)+) = {0} it follows that (C#)+ \ {0} is a proper 
subset of a half-space. A proper subset of a half-space cannot contain any complete 
plane through the origin. Hence the existence of a null plane of Px would obstruct the 
energy-distinguishing property. Third, this fact immediately restricts us to cotangent 
bundle functions P of even degree. For suppose deg P was odd. Then on the one hand, 
we would have an odd number of null sheets. On the other hand, the homogeneity of 
P implies that null sheets in a co-tangent space come in pairs, of which one partner 
is the point reflection of the other. Together this implies that we would have at least 
one null hyperplane. 

Definition. A geometry (M, G, C*) is called a spacetime with respect to matter dynamics 



closure((C#)+) n -closure((C#)+) = {0} . 



if 



1 



P is everywhere bi-hyperbolic. 



2 



G* is a time-orientation defined in terms of P, 



3 



P is everywhere energy-distinguishing with respect to the time-orientation. 
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where P : T*xM — > R is the cotangent bundle function defined by the principal polynomial 
of linear matter dynamics DMN{d)^^ = at each point x e M. 



1. Whether (M, G) presents a viable spacetime structure crucially depends on what linear 
matter equations one chooses to probe (M, G). To recognize this is not a weakness of 
the approach presented here, but rather presents a crucial insight. 

2. The restriction to linear matter dynamics is necessary in order to obtain a principal 
polynomial that depends on the geometry G only, but not on particular solutions of 
the matter field equations. For while non-linear matter equations may be linearized 
as $ = ^ exact + around an exact solution ^ exact, ^-nd the principal symbol P of 
the resulting linearized equations for still determines the causality of the theory, 
P will now in general depend on the exact solution ^exact around which the theory 
was linearized. Thus one cannot isolate purely geometric statements if one considers 
non-linear matter. The lesson is that a spacetime geometry is best probed by linear 
matter, and thus we focus on such. 

3. Lorentzian geometry {M,g) presents a spacetime with respect to Maxwell theory. For 
in Lorentz gauge, the Maxwell equations take the form 



from which one reads off at each x E M the principal polynomial Px{k) — g"'^{x)kakb, 
which is hyperbolic because the inverse metric g~^ has Lorentzian signature. The 
polynomial P*{X) := gabX"'X^ is then indeed dual to P, since 



for all k with P{k) = 0, and also hyperbolic since g has Lorentzian signature. Thus 
P is b-hyperbolic. It is also energy-distinguishing, since choosing one of the two 
hyperbolicity cones of as the time-orientation C*, one finds that 



where C is the hyperbolicity cones of P that lies entirely within (C#)+. But then 
N — U N" and P is also energy-distinguishing. 



Remcirks. 




P*{DP{k)) = gab{'2g''^km){'2g'^kr,) = g^^k^k^ = 



= {g e T;M I q{X) > for all X e C*} ^ closure(C) 
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LECTURE VI: MASSIVE DISPERSION 

Not only for completeness, but because of its crucial role in mapping massive co- vectors to 
associated velocity vectors, we study massive point particles on spacetimes in this lecture. An 
important corollary is a meaningful classification of all physically viable modified dispersion 
relations. 

Definition. Let C C (C*)"*" be a hyperbolicity cone of positive energy. Then any g e C is 
called a massive positive energy momentum, and its mass m > is given by P{q) = m'^^^^ . 

Remarks. 

1. There is always one hyperbolicity cone C of P that hes entirely within (C*)+. For 
we know that the boundary dC of any hyperbolicity cone is a connected set of P- 
null co-vectors, and the disjoint union (C*)"'"U — (C*)"'" covers the entire set of null 
co-vectors. Hence either C or —C lies entirely within the positive energy cone (C*)"'". 

2. The positivity of P on the cone C of massive positive energy momenta is guaranteed 
by our choice of the overall sign in the definition of P, in lecture III. 

Theorem. For a bi-hyperbohc and energy distinguishing P, the so-called barrier function 

: ^ K, Uq) = In P{x, q) 

is strictly convex and essentially smooth, which guarantees that the Legendre map 

: a ^ L,(a) C T,M, L,(g) := -D{\nP){x,q) 

is invertible, with 

given in terms of the Legendre transform 

: L,(a) ^ R, f^{X) = -L-\X)X - UL-\X)) 
of the barrier function /j., at each point x of the manifold M. 
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Remcirks. 

1. Essential smoothness refers to a particular behaviour of the barrier function close to 
the boundary of the convex set on which it is defined; see any text on convex analysis. 
The functions to be Legendre transformed in classical mechanics or thermodynamics 
are usually defined on an entire vector space, which is of course a convex set but has 
no boundary, so that the criterion of essential smoothness is trivially satisfied there, 
and thus less known. 

2. It is only the interplay of bi-hyperbolicity and the energy-distinguishing property that 
ensures the existence and invertibility of the above Legendre map. Thus all criteria 

for a spacctimc structure, as laid out at the end of the previous chapter, are required 
to have the present Legendre theory at our disposal, which in turn will now be used 
to derive the dynamics of massive particles. 

Theorem. The dynamics of a free positive energy particle of mass m > is encoded in the 
action 

S[x]=m J cirP,V)(i:(r))^/^^^^, 

where at each point x E M, the function P*{X) — Px{L~^ {X))~^ is defined on the entire 
cone Lx{Cx) in tangent space. 

Proof. It is obvious that the Helmholtz action 

QaX^ — Am In P(x, —) 
m . 

leads, upon variation with respect to A, to the dispersion relation for a positive energy 
particle momentum q of mass m, and together with the geometry- free term qaX^ presents 
the action for such a particle, very similar to the massless case discussed in lecture IV. 
Variation with respect to q then yields = (A deg P)La,(g/m), which we know how to 
invert due to the Legendre theorem given above, so that 

^ = "^^"^Ad|p)- 

Using the thus given relation between massive momenta and the tangent vector to the 
particle worldline, as well as the definitions of the barrier function and the Legendre dual. 



S[x, g. A] = / dr 



^ 
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one eliminates q from the action and obtains 



S[x,X] = -mdegP / dr Xf^{x/{XdegP)) = -mdegP / dr [A/^^(i;) + Aln(AdegP)] , 



where for the second equahty we used the easily verified scaling property f^{ax) = f^{x) — 
In a. Prom variation of this equivalent action with respect to A, one then learns that 



Using this twice, one has A/^(i;) + A ln( A degP)) = — A = — cxp(— /^(x) — 1)/ dcgP. Noting 
that because of i; e Lj.(Cj.) one also has L~^{x, x){x) — 1 and thus f^{x) — —l — fx{L~^{x)), 
one eliminates A to finally arrive at the equivalent action 



for a free point particle of positive mass m. This concludes the proof. 
Remarks. 

1. While the tangent bundle function P* is generically non-polynomial, it is elementary 
to see that it is homogeneous of degree deg P. 

2. The action we arrived at above is reparametrization invariant, as it should be. How- 
ever, parametrizations for which P{x, L~^{x, x)) = 1 along the curve are distinguished 
since they yield the simple relation 



between the free massive particle velocity x and the particle momentum q everywhere 
along the trajectory x. As usual, we choose such clocks and call the time they show 
proper time. 

3. Thus the physical meaning of the Lcgcndre map is established (namely mapping mas- 
sive positive energy particle momenta to the respective tangent vectors of their world- 
lines), and one may thus justifiably call the open convex cone Lx{Cx) the cone of 
massive particle velocities, and the function P* the massive dual of P, which indeed 
encodes the tangent bundle geometry seen by massive particles. 



/^(i;) + ln(AdegP) + 1 = 0. 



S[x] 




l/degP 



X = Lx{q/m) 
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4. Reassuringly, one can now prove that the observer cone hes in the massive dual, 
Cf C Lx(Cx). Thus one may think of observers as massive, as usual. The converse, 
however, does not hold, since the inclusion is generically proper. 

Definition. An observer is a curve e : M ^ LM in the frame bundle LM over M such that 

1. the first frame vector cq at each point coincides with the tangent vector to the canon- 
ically projected curve tt o e : M ^ M, 

2. eo at each point of the curve hes within the observer cone C* and 

3. L"^(eo)(ea) = for the remaining frame vectors Cq, with a — 1, ... , dimM — 1. 

Remarks. 

1. The vector subspace \4 = {X e T^M \ L~^(eo){X) = 0} contains the purely spatial 
directions seen by the observer. 

2. An observer frame in each tangent space along the curve tt o e induces a unique dual 
frame 

e' = L-\eo),e\...,e''-''-\ 

and the zero component Qq of a co- vector q — qoe'^ + Qa^"', with a sum over a — 
1, . . . , dimM — 1 understood, coincides precisely with the energy of that co- vector as 
seen by the observer. 

Theorem. The dispersion relation of causally propagating matter of mass m > 0, whose 
sign of energy is agreed upon by all observers, must take the form 

for a bi-hyperbolic and energy-distinguishing P, where e'^, . . . e^i™^-! is an observer co-frame. 
Solving the above relation for E then yields the relation between the energy E and the purely 
spatial momentum p^e'^ seen by the particular observer chosen in the decomposition of the 
massive positive energy momentum p & C. 
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Remcirks. 



1. Likewise one obtains the physicaUy meaningful massless dispersion relations, by letting 
p e and m — 0. 

2. Note that the polynomial P enters into the above decomposition twice: firstly in 
determining what constitutes, for a particular observer, a split into the purely temporal 
and purely spatial directions; and secondly in determining the dispersion relation as 
such. In order to play either of these roles, P must be bi-hyperbolic and energy- 
distinguishing, as we saw in these lectures. 

3. It is obvious that once the above relation has been solved for E in terms of the purely 
spatial components of the momentum, to take the form 



it is prohibitively difficult to recognize from the generically infinitely many coefficients 
c, c'*, c°^, . . . , whether the underlying polynomial P was bi-hyperbolic and energy- 
distinguishing (and thus whether the dispersion relation is one of causally propagating 
matter of definite sign of energy at all). But even if that were so, it appears rather 
questionable to attempt to bound the coefficients from comparison with experiment 
without understanding the structure of observer frames, for then the quantities E 
and p^e" are void of any meaning. Corresponding attempts in the phenomenology 
literature are thus to be severely doubted. 



Superluminal motion of massive particles can occur in all spacetimes with degP > 2. 
This is compatible with causality by construction in our spacetimes, but raises the question 
of what mechanism prevents ordinary observation of superluminal massive particles. In this 
lecture we find the answer to this question in the fact that superluminal massive particles 
are kinematically allowed to radiate off massless particles until they are infraluminal. 



oo 




with c — m, 



i=0 



LECTURE VII. SUPERLUMINALITY 



Theorem. The process where a positive energy massive particle of momentum p radiates 
off a positive energy massless particle at a point x in spacetime is kinematically forbidden 
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if and only if p lies in the cone 

l;\c*)cc. (1) 

in cotangent space. 
Remarks. 

1. The cone L^^{Cf) contains precisely those positive energy massive momenta that 
correspond to infraluminal massive particles, since the slowest light is the one on the 
boundary of C*. All other positive energy massive particles travel at a velocity higher 
than the slowest hght. Depending on the geometry, some may travel even faster than 
the fastest light. 

2. For the familiar Lorentzian metric case (degP = 2), one of course obtains that 
L~^{Cf ) = Cx, in other words, positive energy massive particles travel at a veloc- 
ity lower than the speed of light and cannot radiate off a massless particle in vacuo. 
This is often stated as that there is no vacuum Cerenkov radiation, which we now see 
is only true for Lorentzian spacetimes. 

3. The proof of the theorem is given in [A]. 

Reduction to 1+1 dimensions. In order to understand what precisely hes in store with 
the super luminal particles, without obscuring the essentially straightforward argument by 
cumbersome algebra, we consider the situation in 1 + 1 spacetime dimensions. Extension to 
the physically relevant 3 + 1 dimensions presents no conceptual challenges. 



C 




FIG. 5. Massless momenta, mass shell and construction of distinguished points thereon 
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Figure [5] shows the massless momenta in some cotangent space, together with the mass 
shell for mass m. Note that (because the Gauss map sends 'inner' massless cones in cotangent 
space to outer cones in tangent space, and vice versa) the inner cone here corresponds to 
fast massless particles, while the outer one corresponds to slow massless particles. 

Now we construct the points Ai and A2 as those points on the mass shell, where the 
slow massless momentum cone touches the mass shell. The relevance of these points is that 
any massive momentum of higher energy can radiate off a massless particle traveling at the 
speed of slow massless particles, see figure |6j Thus massive particles whose momenta on the 
mass shell lie between the points Ai and A2 are precisely those that cannot decay. Thus 
the straight lines connecting the origin with Ai and A2, respectively, must constitute the 
boundary of the cone identified above. 

O . 




FIG. 6. Momentum diagram for a massive particle radiating off one slow massless particle. 

Now construct the point C as the intersection of the slow massless cone centered at A2 
with the mass shell. Obviously any momentum on the mass shell that lies between the 
points Ai and C can radiate off at most one massless particle, since the outgoing massive 
particle momentum will then lie between A2 and Ai and is thus kinematically protected 
form further decay. Of course a similar point (not shown) could be constructed on the left 
half of the mass shell. 

Finally we obtain the point B (and a similar point on the left, not shown) on the mass 
shell as the intersection of the slow massless momentum cone centered at the point where 
the massive particle energy is the lowest with the mass shell. The significance of this point is 
that it marks the point where the outgoing massive particle ceases to still run into the same 
direction as the ingoing massive particle, since for any ingoing massive momentum between 
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the points B and C the outgoing particle momentum is going into the opposite direction. 

This leaves us with the following expected pattern of arrival of superluminal and infra- 
luminal particles at some detector if the emitted massive particles at some source had a 
momentum p, see figure [7| For the energy range m < E < Eia, between the rest mass m of 
the emitted particle and the energy Ei above which radiating off a massless particle becomes 
possible, the particle is stable and thus arrives at the detector with exactly the momentum 
it was emitted with. For the energy range Eia < E < Eify, where particles are necessarily 
superluminal and Eif, was constructed to be the energy above which the massive particle 
no longer travels in the same direction after it emitted the massless particle, one either 
detects the superluminal particle with unchanged momentum (because no massless particle 
was actually radiated off) or an infraluminal massive particle whose momentum is obtained 
from the construction in figure [6] as a function of the emitted particle momentum. In the 
energy region Eif, < E < E2, where E2 was constructed as the highest energy a superluminal 
particle can have in order to radiate off at most one massless particle, one however expects 
at most superluminal particles to arrive at the detector at precisely the momentum they 
were emitted with. For if a superluminal particle in this energy range were to radiate, its 
momentum would be reversed but then lie in the stable energy region m < E < Eia- Thus 
the prediction from this simple model in 1 + 1 dimensions is that there is an energy range 
within which no superluminal particles are detected. 

lb 2 



E, < E < E 

la 



m < E < E , 

la 



FIG. 7. Particles emitted from a source (left) at certain energy arriving at a detector (right); 
dashed lines are superluminal massive, solid lines infraluminal massive, wiggly lines slow massless 

Calculating the quantum mechanical decay rate on tensorial spacetimes is possible, but 
beyond the scope of these lectures. 
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LECTURE VIII: SPACETIME DYNAMICS 

The aim of this final lecture is to find dynamics that develop initial geometric data 
from one P-spacelike hypersurfaces to another, such that sweeping out the spacetime mani- 
fold this way, one reconstructs a bi-hyperbolic and energy-distinguishing dispersion relation 
everywhere. To this end, one studies hypersurfaces with hyperbolic co-normals by their 
embedding maps and thus calculates how functional of this embedding map change under 
normal and tangential deformations of the hypersurface. This change can be expressed by a 
linear action of deformation operators on such functionals, and it is the commutation algebra 
of these deformation operators that needs to be represented on the phase space of spatial 
geometries in order to obtain canonical dynamics for the (degP, 0)-tensor field P. 

Definition. Let X : E ^ M be a smooth embedding map of a smooth manifold E 
of dimension dimM — 1 with local coordinates into the smooth manifold M with 

coordinates {a;"}. Then the dimM vectors 

T{y) := Lxiy){n{y)), e^{y) := ■ ■ ■ , e^raM-i{y) := g^dimM-i g^ 

constitute a basis of each tangent space Tx{y)M of M along the hypersurface S, where the 
co-normals n{y) are uniquely determined by the conditions 

n e C, P{n) = 1, n{ea) = for all a = 1, ... , dimM - 1 . 

The first basis vector Lx{y){n{y)) is called the (spacetime) normal vector to the hypersur- 
face ^(E) at each point y G E, while the remaining basis vectors ei(y), • • • edimM-i(y) are 
(spacetime) tangent vectors to -'^(E). The uniquely determined co-basis be denoted 

n(y), e\y), e^-^-^(t/) . 

Renicirks. 

1. The spacetime geometry explicitly enters into the above definitions only in determining 
the hypersurface normal vector L(n), but there three-fold: first in determining the 
hyperbolicity cone C to ensure that the hypersurface in question can be an initial data 
surface, second in normalizing the canonically defined co-normal which is otherwise 
only determined up to positive rescalings, and third in mapping the co-normal to a 
normal vector by virtue of the Legendre map L. 
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2. The embedding map determines the purely spatial geometry 

P^^-'^'{y)[X] := P(6"Hy), . . . , e'^'{y)My). ■ ■ ■ Mv)) , / = 2, . . . , degP 

^ V ' ^ V ' 

/ degP-7 

on the hypersurface. At least this is the purely spatial geometry detected by point particles. 

3. One may instead also construct the purely spatial geometry seen by fields, by consid- 
ering all non-vanishing hypersurface tensor fields obtained by inserting the and n 
into the co- vector slots and Cq, and T into the vector slots of the geometric tensor G. 
The precise number and nature of the fields then depends on the precise structure 
(valence and algebraic symmetries) of G. We will thus focus in the following on the 
geometry seen by point particles, but the principle is the same for the geometry seen 
by fields. For an area metric geometry {G"'^'^'^ = G'^'^"'^ = — G''"'^'^), for instance, one 
obtains in d spacetime dimensions the three purely spatial tensor fields 

G"^"^ {y) [X] = G{n{y) , e"^ (y) ,n{y), e"^ (y) ) , 

4. In any case, note that the quantities defined under 2. and 3., describing the purely 
spatial geometry, are (i) tensors on E and (ii) fuuctiuuals of the embedding map. 
While similar quantities are defined for / = and 1=1, they are constantly 1 and 0, 
respectively, due to our definition of n. 

5. Assume we are already given a spacetime {M,G,C*) with an associated cotangent 
bundle P, and consider a hypersurface embedding X : S ^ M. Then one can study 
how functionals of X, such as the geometric data in the previous remark, change 
under deformations of the hypersurface. Such deformations in normal and tangential 
directions of the hypersurface are encoded by specifying a hypersurface-scalar field N 
and hypersurface- vector field N'^ta, respectively. The change of a functional F of X 
is then given, to hnear order, hy H{N)F and T>{N°'ea)F, where 

nN) = Jj'-'yN{y)T'^{y)j^ and V{N'^e^) = j^d'-'y N'^{y)el{y)j^^ 

are the normal and tangential deformation operator, respectively. 
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Theorem. The normal and tangential deformation operators, acting on functional on an 
initial data hypersurface, satisfy the hypersurface deformation algebra 

[H{N),'H{M)] = -P((degP - l)P''^{Md^N - Nd^M)dc) , 
[V(N-d^),n{M)] = -n(N-d^M) , 

[V{N''da),V{M^d^)] = -V{{N^dfsM'' - M^dpN'')da) ■ 

Reiricirks. 

1. Note that the background geometry enters the hypersurface deformation algebra ex- 
clusively through the definition of the normal vectors T(y) to each point X{y) of the 
hypersurface XCE), since these are obtained from the canonically defined normal co- 
vectors n{y) by virtue of the Legendre map L, which indeed is determined through 
P. Also observe that only the first commutation relation depends on the spacetime 
geometry, while the other two are fully independent of it. 

2. The hypersurface deformation operators are constructed such as to describe the change 
of functionals if one moves from one initial value hypersurface to another one near- 
by, if indeed the entire spacetime geometry is already known. Now the problem of 
finding dynamics for the spatial geometry is to find equations that provide the spatial 
geometry on a near-by hypersurface solely from the spatial geometry on the original 
hypersurface, without the entire spacetime geometry already being known. This is of 
course at most possible if one compensates for the lack of information about the geome- 
try around the hypersurface by prescribing additional initial data on the hypersurface, 
in form of canonical momenta 

associated with the purely spatial geometric data given by 

paia2 paia2a3 paia2...Q;degP 

where these are now no longer understood as induced from a known spacetime ge- 
ometry, but as independent tensors on E; to avoid any conceptual confusion, we 
marked these objects with a hat. The space of the tensor fields (P^,?!^) is called 
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the geometric phase space, and the requirement that the tta be canonically conjugate 
to the fixes the Poisson bracket on geometric phase space to 

6F 6G _ SG 6F 

where the observables F and G are functionals of the phase space variables, and a capi- 
tal Latin index A denotes the entire collection of indices (aia2, aia2(^3, ■ ■ ■ , ci\Ci2 ■ ■ ■ ^dcgp) 
and summation over a capital latin index is summation over all those spacetime indices. 

3. As discussed before, one may alternatively wish to construct the phase space of 
the purely spatial geometry seen by fields. The number of fields replacing the 
P"i"2, . . . p"i"2 -.a/ jj^ ^]-^g previous construction then depends on the precise structure 
of the geometric tensor G. For an area metric geometry {G°'^'^ — G"^"^ — —G^'^"^) in 
d spacetime dimensions, the initial data on a hypersurface are described by the three 
tensor fields 

together with canonically conjugate momenta 
The Poisson brackets are constructed accordingly. 




Definition. Dynamics for observables on the geometric phase space is given by the 
Hamiltonian 

jj^'-'y [?i(iV)+:D(iV"e„)] , 

where the superhamiltonian 7^ and supermomentum V are functionals of the geometric 
phase space variables (P^, tta) that represent the hypersurface deformation algebra by virtue 



of 



{niA),niB)} = P((degP - l)P^^iAd^B - Bd^A)da) , 
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Remcirks. 

1. This definition describes precisely what dynamics is all about: geometric data on an 
initial data hypersurface are evolved such that, if put on a neighbouring hypersurface 
(determined from the original hypersurface by the deformation fields N and N'^Ca), and 
this data then again put on a neighbouring hypersurface, and so forth, the entirety of 
spatial geometries on all these hypersurfaces constitutes an entire spacetime geometry. 
The representation requirement then ensures that this is consistent with the idea 
that the the data on the individual hypersurfaces are really nothing but the purely 
spatial geometries induced from that overall spacetime geometry, with all its required 
properties. 

2. With the mathematical technology for (generalized) spacetime geometries developed 
in these lectures, it can be shown from the third Poisson bracket relation that the 
supermomentum for the geometry seen by point particles is uniquely given by 

dcgP 
1=2 "^^ 

3. Prom the second Poisson bracket relation it is possible to show that the locahzed 
superhamiltonian 'H(y) := i^{5y) for the geometry seen by point particles is, first, a 
scalar density of weight one, and, second, decomposes into non-local and local parts 
according to 

H{y)[P, Tt] = ?ilocal(Z/)[P](7r) + Ur^on-loUyW . Tt] , 

of which the non-local part however is directly determined to be 

degP 

7inon-local(2/) = ^ [(dcg P-/)a^(P^"i-"^7r«,..,J-(deg P-1) / a^(P"-"^P"i'^V...aj] 
1=2 

and only the local part T^iocai remains to be determined from the first Poisson bracket. 

4. With the supermomentum and non-local part of the superhamiltonian already known, 
the first Poisson bracket relation finally presents a condition quadratic in the local 
part of the superhamiltonian. Remarkably, this is equivalent to a linear condition in 

the Legendre transform of ?iiocai in the variables, 

L{y)[P]{K) := iTA{y)[P]{K)K\y) - Uiy^UHmiK)) , 
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where the Legendre dual variables are given by 

For with these definitions, the remaining first Poisson bracket relation takes the form 
of the linear functional differential equation 



6P^{y) 



dL{x) j^B(^n 



dK^{x) 



K^{x)QB'''{x)dMy) 



dK^{x) 

where the coefficients R^^^^ , Qa^'^ and M^'^ contain only the configuration variables, 
but not their derivatives, 

^a^.-aiP ^ _(degP - I)pP'^^-»i + /(degP - l)p/5(«ip"2...a/) ^ 

-5f_,/(degP-l)P'^('^^5^-.;:SL, 
but the coefficients 5'^'' also containing their first partial derivatives, 

ga,...ajn ^ 7(degP - l)(degP - 2)P'^t("i9^P"2 + 7(degP - l)(degP - 7)9^pT{"ip«2...a,),. 

+ (degP - l)P^T5^P"i-"^ - 7(7 - l)(degP - I)2a^p7(aipa2...a,_ipa,)/. _ 

5. The linearity of the first Poisson bracket relation in L allows for a power series ansatz 

oo 

L(x)[P](7^) = Yl ^(^)(^' ■ ■ ■ )M...A,K^'ix) . . . K^'ix) , 

i=0 

SO that the problem of finding the local superhamiltonian, and thus the gravitational 
dynamics, reduces to solving the equations for the coefficients Gai...An that result from 
making this ansatz, and the corresponding equations are the subject of the following 
theorem. 



Theorem. The coefficients Ga, Gab, Gabc, ■ ■ ■ defining the local part of the superhamil- 
tonian for the geometry seen by point particles (and thus in conjunction with the already 
known non-local part and the supermomentum the entire gravitational dynamics) 
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(a) depend at most on the geometric variables and their partial derivatives up to (and 
including) second order 

(b) are completely determined by the representation of the deformation algebra equations 



(I) =2d^{GAR^^n + ^GaS""^ - 2d, 



dG 



-M^l'^) - 4a, 



dG 



ddf , P^ 



dP^ dd„P^ ^ ddl.P^ 



1X1/ 



(II) ={N + 1)! Gab^.-.b^R^""^ - N\ M^l") - 2N\ ^^-^^ g^M^'") 

dd(^i3\P^ ddfai.P^ 



dG, 



-{N - 2){N - 1)\ , 



ddlpP^^ 



N 



(III) =(7V + l)!^^^,...^,,^^" + (TV - 1)! J] 



dG 



a=l 



Bl...Ba...BN 

dd^P^'^ 



-2{N-iy.d. 



dGB,. 



^^2 pBN 



dd^P^ 



QQ2^pA 



—NN\Q(^Bi'^'^Gb2...Bn}m , 
which also include the two symmetry conditions 

dGsi Bn- 



(IV) 
(V) 



dG 



B\...Ba---Bf, 



gg2 pBa f)f)2 pBN 

7172 7172 



for all > 1, a = 1,...,A^ 



dG 

= ^^-^^ M-^l^) for all AT > , 

dr^ 

as well as the three invariance conditions 



f)f)2 pA 



deg P 



(VI) = ^ / P"2...a/(- 



7=2 

deg P 



dd"^ ,P<X2 -aip ' 

/it/) 



(VII) = Yl 



1=2 
degP 



J pa2--ai(tJ. Bi...Bjsi q pai...aj ^'-^Bi.-.-Bjv ^ Q pa2.-aiifJ, ^'-^i?i...-Bj\ 



dd^)P"2-aiP 



f)f)2 pai...ai 



(VIII) 



Tj pp02-I3i '^Gbi...Bn j^jq pp02...0i '^Gb^...Bn 
^ L dP02-l3,tx ^ QQ^p^2-Pw 



dG 



Bi ...-Bj\ 



ddpPP^-0' 



+ 1 d^sP^^^-^' ■ 



dG 



B^...Bj. 



dd^sPf^^■■■P'^' 



-2d P^^-l^' ^^^i -^N 

ddp^pp^-P' 



,Y)/iB2...Bj> 



Bi...B^_iM{/3r'...<^V^)' 



where rij is the number of small indices contained in the capital index Bi — ^[ 
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Remcirks. 

1. Solving this set of linear partial differential equations and consequently solving for 
the canonical momenta tt^ in terms of their Legendre duals is the mathematical 
problem of finding modified gravity theories for the geometry seen by point particles. 
For finding dynamics for the geometry seen by fields one proceeds fully analogously. 

2. For the case degP = 2 and dimM = 4, it is relatively straightforward to obtain that 
the above equations determine that the only non-vanishing coefficients are 

Gap ^ py^iRa(3 - l/^QapR) + (T^/ggafi , 

with the Ricci tensor Rajs and Ricci scalar R associated with the Lorentzian metric 
P"^ = g'^^ and four undetermined real integration constants k, A, p, cr, of which the 
last two, however, can be set to zero without changing the gravitational equations 
of motion. The remaining integration constants k and A are the gravitational and 
cosmological constant, respectively, and must be determined by experiment. This is 
Einstein's general relativity, the simplest gravity theory. 

3. In particular (i), the question whether there are any modified gravitational dynamics 
at all is the question of existence of solutions of the above system for any even deg P > 
2; (ii) the question whether there is only one modified gravity theory for each even 

dcg P > 2 amounts to the question of the uniqueness of solutions of the above system; 
(iii) the question of what the concrete gravitational dynamics are for each degP, given 
their existence, is the problem of finding explicit solutions for the above system. 
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